METHOD AND APPARATUS FOR IMPROVED TEMPERATURE CONTROL 

IN ATOMIC LAYER DEPOSITION 
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CROSS-REFERENCE TO RELATED APPLICATIONS 
5 [0001] This application claims the benefit of U.S. Provisional Application 
No. 60/251,795, filed December 6, 2000, US. Provisional Application No. 
60/254,280, filed December 6, 2000 and U.S. Utility Applications Nos. 
09/812,285, 09/812,352, and 09/812,486; all filed March 19, 2001. 

BACKGROUND OF THE INVENTION 

10 Field of the Invention 

[0002] The present invention relates to the field of film deposition, and more 
particularly, to a method and apparatus for improving and enhancing 
temperature control in atomic layer deposition (ALD). 

Description of the Background Art 

15 [0003] The present invention relates generally to the field of advanced thin 
film deposition methods commonly used in the semiconductor, data storage, flat 
panel display, as well as allied and other industries. More particularly, the 
present invention relates to improved atomic layer deposition whereby the 
kinetics of the adsorption of the first precursor and the subsequent reaction with 

20 the second precursor are decoupled. By decoupling, we mean causing the 
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reaction (i.e., adsorption) of the first precursor to occur at a different temperature 
state than the temperature state required for the reaction with the second 
precursor. More importantly, methods and apparatus for improved temperature 
control in ALD are disclosed that can switch between these two thermal states in 
5 rapid succession. 

[0004] The disadvantages of conventional ALD are additionally discussed 
in copending applications with the same assignee entitled ''Sequential Method 
For Depositing A Film By Modulated Ion-Induced Atomic Layer Deposition 
(Mil- ALD)'', "System and Method for Modulated Ion Induced Atomic Layer 

10 Deposition (MIL ALD)", and "Continuous Method For Depositing A Film By 

Modulated Ion-Induced Atomic Layer Deposition (MIL ALD)" which are hereby 
incorporated by reference in their entirety and may be found as copending utility 
applications, Application Nos. 09/812,285, 09/812,352, 09/812,486 respectively. 
[0005] As integrated circuit (IC) dimensions shrink and the aspect ratios of 

15 the resulting features increase, the ability to deposit conformal, ultra-thin films 
on the sides and bottoms of high aspect ratio trenches and vias becomes 
increasingly important. These conformal, ultra-thin films are typically used as 
"liner" materials to enhance adhesion, prevent inter-diffusion and /or chemical 
reaction between the underlying dielectric and the overlying metal, and promote 

20 the deposition of a subsequent film. 

[0006] In addition, decreasing device dimensions and increasing device 
densities has necessitated the transition from traditional CVD tungsten plug and 
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aluminum interconnect technology to copper interconnect technology. This 
transition is driven by both the increasing impact of the RC interconnect delay on 
device speed and by the electromigration (i.e., the self-diffusion of metal along 
interconnects, thereby affecting reliability) limitations of aluminum based 
5 conductors for sub 0.25 jim device generations. Copper is preferred due to its 
lower resistivity and higher (more than 10 times) electromigration resistance as 
compared to aluminum. A single or dual damascene copper metallization 
scheme is used since it eliminates the need for copper etching and reduces the 
number of integration steps required. However, the burden now shifts to the 
10 metal deposition step(s) as the copper must fill predefined high aspect ratio 

trenches and /or vias in the dielectric. Electroplating has emerged as the copper 
fill technique of choice due to its low deposition temperature, high deposition 
rate, and potential low manufacturing cost. 

[0007] Two major challenges exist for copper wiring technology: the barrier 
15 and seed layers. Copper can diffuse readily into silicon and most dielectrics. 
This leads to electrical leakage between metal wires and poor device 
performance. An encapsulating barrier layer is needed to isolate the copper from 
the surrounding material (e.g., dielectric. Si), thus preventing copper diffusion 
and/ or reaction with the underlying material (e.g., dielectric. Si). In addition, the 
20 barrier layer also serves as the adhesion or glue layer between the patterned 

dielectric trench or via and the copper used to fill it. The dielectric material can 
be a low dielectric constant , i.e., low-k material (used to reduce inter- and intra- 
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line capacitance and cross-talk) which typically suffers from poorer adhesion 
characteristics and lower thermal stability than traditional oxide insulators. 
Consequently, this places more stringent requirements on the barrier material 
and deposition method. An inferior adhesion layer will, for example, lead to 
5 delamination at either the barrier-to-dielectric or barrier-to-copper interfaces 
during any subsequent anneal and /or chemical mechanical planarization (CMP) 
processing steps leading to degradation in device performance and reliability. 
Ideally, the barrier layer should be thin, conformal, defect free, and of low 
resistivity so as to not compromise the conductance of the copper metal 

10 interconnect structure. 

[0008] In addition, electroplating fill requires a copper seed layer, which 
serves to both carry the plating current and act as the nucleation layer. The 
preferred seed layer should be smooth, continuous, of high purity, and have 
good step coverage with low overhang. A discontinuity in the seed layer will 

15 lead to side wall voiding, while gross overhang will lead to pinch-off and the 
formation of top voids. 

[0009] Both the barrier and seed layers which are critical to successful 
implementation of copper interconnects require a means of depositing high 
purity, conformal, ultra-thin films at low substrate temperatures. 
20 [0010] Physical vapor deposition (PVD) or sputtering has been adopted as 
the preferred method of choice for depositing conductor films used in IC 
manufacturing. This choice has been primarily driven by the low cost, simple 
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sputtering approach whereby relatively pure elemental or compoimd materials 
can be deposited at relatively low substrate temperatures. For example, 
refractory based metals and metal compounds such as tantalum (Ta), tantalum 
nitride (TaN J, other tantalum containing compounds, tungsten (W), tungsten 
5 nitride (WNJ, and other tungsten containing compounds which are used as 
barrier /adhesion layers can be sputter deposited with the substrate at or near 
room temperature. However, as device geometries have decreased, the step 
coverage limitations of PVD have increasingly become an issue since it is 
inherently a line-of-sight process. This limits the total number of atoms or 

10 molecules which can be delivered into the patterned trench or via. As a result, 
PVD is unable to deposit thin continuous films of adequate thickness to coat the 
sides and bottoms of high aspect ratio trenches and vias. Moreover, 
medium/high-density plasma and ionized PVD sources developed to address 
the more aggressive device structures are still not adequate and are now of such 

15 complexity that cost and reliability have become serious concerns. 

[00111 Chemical vapor deposition (CVD) processes offer improved step 
coverage since CVD processes can be tailored to provide conformal films. 
Conformality ensures the deposited films match the shape of the underlying 
substrate, and the film thickness inside the feature is uniform and equivalent to 

20 the thickness outside the feature. Unfortunately, CVD requires comparatively 
high deposition temperatures, suffers from high impurity concentrations which 
impact film integrity, and have higher cost-of-ownership due to long nucleation 
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times and poor precursor utilization efficiency. Following the tantalum 
containing barrier example, CVD Ta and TaN films require substrate 
temperatures ranging from 500 °C to over 800 and suffer from impurity 
concentrations (typically of carbon and oxygen) ranging from several to tens of 
5 atomic % concentration. This generally leads to high film resistivities (up to 
several orders of magnitude higher than PVD), and other degradation in film 
performance. These deposition temperatures and impurity concentrations make 
CVD Ta and TaN unusable for IC manufacturing, in particular for copper 
metallization and low-k integration. 

W 10 [0012] Chen et al (''Low temperature plasma-assisted chemical vapor 
deposition of tantalum nitride from tantalum pentabromide for copper 

J metallization", J. Vac. Sci. Technol. B 17(1), pp. 182-185 (1999); and "Low 

^ temperature plasma-promoted chemical vapor deposition of tantalum from 

P 

Y} tantalum pentabromide for copper metallization'', J. Vac. Sci. Technol. B 16(5), 

U 15 pp. 2887-2890 (1998)) have demonstrated a plasma-assisted (PACVD) or plasma- 

enhanced (PECVD) CVD approach using tantalum pentabromide (TaBrj) as the 
precursor to reduce the deposition temperature. Ta and TaN^ films were 
deposited from 350 "^C to 450 ""C and contained 2,5 to 3 atomic % concentration of 
bromine. Although the deposition temperature has been reduced by increased 
20 fragmentation (and hence increased reactivity) of the precursor gases in the gas 
phase via a plasma, the same fragmentation leads to the deposition of unwanted 
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impurities. Gas-phase fragmentation of the precursor into both desired and 
undesired species inherently limits the efficacy of this approach, 
[0013] Recently, atomic layer chemical vapor deposition (AL-CVD) or 
atomic layer deposition (ALD) has been proposed as an alternative method to 

5 CVD for depositing conformal ultra-thin films at comparatively lower 

temperatures. ALD is similar to CVD except that the substrate is sequentially 
exposed to one reactant at a time. Conceptually, it is a simple process: a first 
reactant is introduced onto a heated substrate whereby it forms a monolayer on 
the surface of the substrate. Excess reactant is pumped out. Next a second 

10 reactant is introduced and reacts with the first reactant to form a monolayer of 
the desired film via a self -limiting surface reaction. The process is self-limiting 
since the deposition reaction halts once the initially adsorbed (physi- or chemi- 
sorbed) monolayer of the first reactant has fully reacted with the second reactant. 
Finally, the excess second reactant is evacuated. The above sequence of events 

15 comprises one deposition cycle. The desired film thickness is obtained by 
repeating the deposition cycle the required number of times. 
[0014] In practice, ALD is complicated by the painsta]<ing selection of a 
process temperature setpoint wherein both: 1) at least one of the reactants 
sufficiently adsorbs to a monolayer and 2) the surface deposition reaction can 

20 occur with adequate growth rate and film purity. If the substrate temperature 
needed for the deposition reaction is too high, desorption or decomposition of 
the first adsorbed reactant occurs, thereby eliminating the layer-by-layer process. 
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If the temperature is too low, the deposition reaction may be incomplete (i.e., 
. very slow), not occur at all, or lead to poor film quality (e.g., high resistivity 
and/or high impurity content). Since the ALD process is entirely thermal, 
selection of available precursors (i.e., reactants) that fit the temperature window 
5 becomes difficult and sometimes unattainable. Due to the above-mentioned 
temperature-related problems, ALD has been typically limited to the deposition 
of semiconductors and insulators as opposed to metals, 

[0015] Continuing with the TaN example, ALD of TaN films is confined to a 
narrow temperature window of 400 °C to 500 °C, generally occurs with a 
5 10 maximum deposition rate of 0.2 A/cycle, and can contain up to several atomic 
% percent of impurities including chlorine and oxygen. Chlorine is a corrosive, can 

P attack copper, and lead to reliability concerns. The above process is unsuitable 

■ for copper metallization and low-k integration due to the high deposition 

temperature, slow deposition rate, and chlorine impurity incorporation. 



S3 



15 [0016] In conventional ALD of metal films, gaseous hydrogen (H^) or 
elemental zinc (Zn) is often cited as the second reactant. These reactants are 
chosen since they act as a reducing agent to bring the metal atom contained in 
the first reactant to the desired oxidation state in order to deposit the end film. 
Gaseous, diatomic hydrogen (HJ is an inefficient reducing agent due to its 

20 chemical stability, and elemental zinc has low volatility (e.g., it is very difficult to 
deliver sufficient amounts of Zn vapor to the substrate) and is generally 
incompatible with IC manufacturing. Unfortunately, due to the temperature 
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conflicts that plague the ALD method and lack of kinetically favorable second 
reactant, serious compromises in process performance result. 
[0017] In order to address the limitations of traditional thermal or pyrolytic 
ALD, radical enhanced atomic layer deposition (REALD, US Patent 5,916,365) or 
5 plasma-enhanced atomic layer deposition has been proposed whereby a 
downstream radio-frequency (RF) glow discharge is used to dissociate the 
second reactant to form more reactive radical species which drives the reaction at 
lower substrate temperatures. Using such a technique, Ta ALD films have been 
deposited at 0.16 to 0.5 A/cycle at 25 °C, and up to 1.67 A/cycle at 250 °C to 450 

10 Although RE ALD results in a lower operating substrate temperature than all 

the aforementioned techniques, the process still suffers from several significant 
drawbacks. Higher temperatures must still be used to generate appreciable 
deposition rates. However, such temperatures are still too high for some films of 
significant interest in IC manufacturing such as polymer-based low-k dielectrics 

15 that are stable up to temperatures of only 200 °C or less. RE ALD remains a 
thermal or pyrolytic process similar to ALD and even CVD since the substrate 
temperature provides the required activation energy for the process and is 
therefore the primary control means for driving the deposition reaction. 
[0018] In addition, Ta films deposited using REALD still contain chlorine as 

20 well as oxygen impurities, and are of low density. A low density or porous film 
leads to a poor barrier against copper diffusion since copper atoms and ions have 
more pathways to traverse the barrier material. Moreover, a porous or under- 
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dense film has lower chemical stability and can react undesirably with overlying 
or underlying films, or with exposure to gases commonly used in IC 
manufacturing processes. 

[0019] Another limitation of REALD is that the radical generation and 
5 delivery is inefficient and undesirable, RF (such as 13.56 MHz) plasma 

generation of radicals used as the second reactant such as atomic H is not as 
efficient as microwave plasma due to the enhanced efficiency of microwave 
energy transfer to electrons used to sustain and dissociate reactants introduced in 
the plasma. Furthermore, having a downstream configuration whereby the 

10 radical generating plasma is contained in a separate vessel located remotely from 
the main chamber where the substrate is situated and using a small aperture to 
introduce the radicals from the remote plasma vessel to the main chamber body 
significantly decreases the efficiency of transport of the second radical reactant. 
Both gas-phase and wall recombination will reduce the flux of desired radicals 

15 that can reach the substrate. In the case of atomic H, these recombination 

pathways will lead to the formation of diatomic H^, a far less effective reducing 
agent. If the plasma used to generate the radicals was placed directly over the 
substrate, then the deposition of unwanted impurities and particles can occur 
similarly to the case of plasma-assisted CVD, 

20 [0020] Finally, ALD (or any derivative such as REALD) is fundamentally 
slow since it relies on a sequential process whereby each deposition cycle is 
comprised of at least two separate reactant flow and evacuation steps which can 
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occur on the order of minutes with conventional valve and chamber technology. 
Significant improvements resulting in faster ALD are needed to make it more 
suitable for commercial IC manufacturing. 

SUMMARY OF THE INVENTION 
5 [0021] A method for depositing a film on a substrate in a chamber 
comprising adjusting a temperature of said substrate to a first temperature, 
introducing a first reactant gas into said chamber, adsorbing substantially at least 
one monolayer of said first reactant gas onto said substrate, evacuating any 
excess of said first reactant gas from said chamber, adjusting a temperature of 

10 said substrate to a second temperature, introducing a second reactant gas into 
said chamber to react with said first reactant gas to produce said film on said 
substrate, and evacuating any excess of said second reactant gas from said 
chamber; and adjusting a temperature of said substrate to a third temperature. 
[0022] A system for depositing a film on a substrate in a chamber 

15 comprising a means for adjusting a temperature of said substrate to a first 
temperature, a means for introducing a first reactant gas into said chamber, a 
means for adsorbing substantially at least one monolayer of said first reactant 
gas onto said substrate, a means for evacuating any excess of said first reactant 
gas from said chamber, a means for adjusting a temperature of said substrate to a 

20 second temperature, a means for introducing a second reactant gas into said 
chamber to react with said first reactant gas to produce said film on said 
substrate, a means for evacuating any excess of said second reactant gas from 
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said chamber, and a means for adjusting a temperature of said substrate to a 
third temperature. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0023] FIG. 1 shows the relative timing diagram of the sequence for the 
5 improved ALD method incorporating two (or more) discrete temperature states; 
[0024] FIG, 2 shows the relative timing diagram of an alternative sequence 
for the improved ALD method incorporating two (or more) discrete temperature 
states; 

[0025] FIG. 3 is the ALD system schematic incorporating a lamp array for 
10 rapidly heating, and a chilled electrostatic chuck (ESC) for rapidly cooling, the 
substrate; 

[0026] FIG. 4 is the ALD system schematic incorporating a mechanically- 
scanned laser (coupled with wafer rotation) for rapidly heating, and a chilled 
electrostatic chuck (ESC) for rapidly cooling, the substrate; and 
15 [0027] FIG. 5 shows the relative timing diagram of substrate temperature 
response to irradiation and state-dependent control of backside gas pressure. 

DETAILED DESCRIPTION OF THE INVENTION 

[0028] The present invention resolves the ALD temperature dilemma 
20 revolving around the use of a single, fixed substrate temperature setpoint as the 
principal means of controlling or driving the deposition reaction. The present 
invention does this by allowing one part of the ALD process sequence (e.g.. 
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adsorption of the first reactant) to occur at a first temperature (typically lower) 
while allowing another part of the ALD process sequence (e.g., reaction between 
the second reactant with the adsorbed first reactant) to occur at a second 
temperature (typically higher). In such a fashion, the first temperature can be 
5 chosen to be a lower level such that decomposition or desorption of the adsorbed 
first reactant does not occur, and the second temperature can be chosen to be of a 
higher level such that comparably greater deposition rate and film purity can be 
achieved. More importantly, the invention relates to methods and apparatuses 
for improved temperature control in ALD that can switch between these two 

10 thermal states in rapid succession. Via these methods and apparatuses, the 
limitations in precursor choice can be resolved while improving process 
performance and deposition rate. In particular, precursor choice can be 
expanded to include metals — a highly desirable class of materials previously not 
viable with conventional ALD. 

15 [0029] Since ALD is by definition a slow process, introducing a second 

temperature state would typically compete with the desire to increase processing 
speed, i.e., the ALD flow-evacuate-flow-evacuate sequence would take even 
longer. This would be especially true in a conventional, isothermal hot-wall or 
resistively heated pedestal reactor system commonly used for ALD. This is 

20 because the reactor or pedestal must be heated to a higher temperature and then 
cooled to a lower temperature, which can take several minutes, or greater, due to 
the large thermal masses involved. Since each deposition cycle results in a film 
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thickness of at most one monolayer, the process would be extremely slow (much 
slower than even conventional ALD and its derivatives) and highly unfavorable 
for IC device manufacturing. However, methods exist to rapidly impart energy 
into a substrate (which may be a ''bare" substrate, e.g., a silicon wafer before any 
5 films have been deposited, or it may be a substrate which may already have had 
one or more films deposited on its surface), either globally or in a focused or 
otherwise localized fashion, causing a transient, rather than quasi-static, change 
in substrate temperature. The substantial process benefits of employing such 
methods, in particular improved adsorption and stability of the first reactant 
10 plus substantially increased deposition rate, significantly outweigh the increased 
complexity, 

[0030] FIG, 1 shows a sequence for an improved ALD method incorporating 
two (or more) discrete temperature states. In the variant of the method shown in 
FIG. 1, the substrate temperature is ramped 134 during the evacuation 124 of the 

15 first reactant Note that the time axis is not to scale. FIG. 2 shows an alternative 
sequence for an improved ALD method incorporating two (or more) discrete 
temperature states. In the variant of the method shown in FIG. 2, the substrate 
temperature is ramped 234 during the flow 212 of the second reactant. Again, 
note that the time axis is not to scale, 

20 [0031] An improved ALD sequence incorporating the aforementioned 
invention is as follows: 
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1. First exposure 100, 200 : A substrate heated (or cooled) to a first 
temperature, 132, 232, is exposed 102, 202 to a first gaseous 
reactant, allowing a monolayer of the reactant to form on the 
surface. 

2. First evacuation: The excess reactant is removed by evacuating 124, 
224 the chamber with a vacuum pump. An inert gas purge (e.g,, 
Ar, H^/ He) can be used in conjunction to speed 

evacuation /removal of any excess first reactant. 

3. Second exposure 110, 210 : The substrate is then heated (or cooled) 
to a second temperature, T^ 136, 236, where T2 136, 236 is not equal 
to T, 132, 232. A second gaseous reactant is introduced 112, 212 
into the reactor chamber and onto the substrate. The first and 
second (chemi- or physi-sorbed) reactants react to produce a solid 
thin monolayer of the desired film. The reaction between the first 
and second reactants is self-limiting in that the reaction between 
them terminates after the initial monolayer of the first reactant is 
consumed. 

4. Second evacuation 126, 226: The excess second reactant is removed 
by again evacuating 126, 226 the chamber with the vacuum pump. 
An inert gas purge (e.g., Ar, H^, He) can be used in conjunction to 
speed evacuation /removal of any excess first reactant. The 
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substrate is then cooled (or heated) back to a first temperature, 
139, 239. 

5. Repeat: The desired film thickness is built up by repeating the 
entire process cycle (steps 1-4) many times. 
5 [0032] Additional precursor gases may be introduced and evacuated as 
required for a given process to create tailored films of varying compositions or 
materials. 

[0033] Preferably, 136, 236 is greater than 132, 232. The first 
temperature, T^ 132, 232, needs to be low enough so that the first reactant 

10 sufficiently forms a monolayer and does not decompose or desorb from the 
substrate. However, T2 136, 236 must be high enough in order to drive the 
deposition reaction and improve film purity. Typically, T^ 132, 232 can range 
from 20 ''C or lower up to 300 "'C, but more preferably less than 200 ""C, while T^ 
136, 236 can range from 200 ""C to 600 ""C or greater. The temperatures chosen 

15 depend on the reactants used and the types of films being deposited. Of course, 
T^ 132, 232 could be the ambient temperature state of the substrate initially and, 
as such, would not be initially heated. 

[0034] The temperature ramp up rate 134, 234 during heating, a, is 
preferably at least 200 °C/ sec, and more preferably, higher. The temperature 
20 ramp down rate 138, 238 during cooling, p, is preferably at least 100 ""C/sec, and 
more preferably higher. In practice, a is larger than p. 
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[0035] The methods of the present invention can toggle the substrate surface 
quickly between two or more temperatures to yield properly decoupled 
reactions. The energy may be delivered by ions, electrons, photons, or by a 
thermal energy means that primarily affects the top surface of the substrate 
5 undergoing deposition in a transient fashion. Sources for imparting such energy 
may come from rapid thermal processing (RTF) or laser irradiation. An electron 
beam may similarly be used. Any of these energy-inducing methods serve to 
cause a rapid, transient heating of the substrate. 

[0036] FIG. 3 shows an ALD system schematic incorporating a lamp array 
^fl 10 310 for rapidly heating a substrate 360 and a chilled ESC 370 for rapidly cooling 
''ll the substrate 360. Means for valving and controlling 345 the pressure of the 

, S bacl<side gas 340 are also shown. FIG. 3 shows a preferred method of heating the 

is substrate in the manner described herein by rapid thermal processing. RTF 

yi refers to a process in which the heating cycle is very rapid and is typically 

p. 15 performed via radiant heating 310 utilizing graphite heaters, plasma arc, 

tungsten halogen lamps, or other means well known in the art. This RTF system 
300 is coupled to the substrate 360 such that the substrate 360 surface is brought 
up to required temperature in seconds (as opposed to minutes for typical 
isothermal processes) with typical temperature ramp rates of 100-300 °C/sec. 
20 [0037] FIG. 4 shows an ALD system schematic incorporating a 

mechanically-scanned laser 410 (coupled with wafer 360 rotation) for rapidly 
heating a substrate 360 and a chilled ESC 370 for rapidly cooling the substrate 
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360. Means for valving and controlling 345 the pressure of the backside gas 340 
are also shown. FIG. 4 shows an alternative embodiment where an infrared (IR), 
ultraviolet (UV), or deep ultraviolet (DUV) laser 410 may be employed to rapidly 
heat a substrate 360 , whereby the beam is scanned rapidly over the entire area of 
5 the substrate 360. Alternatively, other forms of irradiation such as extreme- 
ultraviolet (EUV) or other radiation forms such as x-rays may be employed. 
[0038] The scanning means is accomplished by methods that are well 
known in the art. Alternatively, the substrate 360 can be moved with respect to 
the laser 410 (such as rotating the substrate with respect to a laser line source or 
10 point source) so that uniform irradiation of the substrate 360 will occur or the 
laser 410 may simply scan the entire surface. Laser 410 heating methods may 
locally heat a substrate 360 with temperature ramp rates of 200-700 ""C/sec or 
greater - typically, higher than RTP. 

[0039] These rapid heating methods reduce the overall thermal budget of 
15 the ALD process since the substrate 360 is only at a peak temperature for a short 
duration of time (on the order of seconds or less). This reduces the overall 
thermal budget of the process and enables the use of peak temperatures greater 
than if the substrate 360 was held at a constant temperature for longer periods of 
time. 

20 [0040] Regardless of the energy source used for heating the substrate 360 , 
the substrate 360 must be rapidly cooled, preferably through the use of a cooled 
pedestal. A cooled pedestal is a substrate 360 holder that retains the wafer 360 or 
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other substrate 360 against a cooled surface and introduces a ''backside'' gas 340, 
typically at pressures of 3-10 torr, as a thermal heat transfer medium in the space 
365 between them (i.e., the substrate and the cooled pedestal). Thermal coupling 
between the substrate and pedestal generally increases for increasing gas 
5 pressure, but saturates at an upper limit, typically around 10-20 torr depending 
on gas species, gap spacing, and geometry of the interface. Typical gases used 
are Ar and He. Although pedestals incorporating clamp rings for retaining the 
wafer can be used, electrostatic chucks 370, are preferred. ESCs 370 use 
electrostatic attraction to retain the wafer with a minimal substrate-pedestal gap 
10 distance and therefore attain better heat transfer than clamp ring systems. It is 
known that ESCs 370 can be designed with cooling capacities of approximately 
200 - 350 W/m'"K. 

[0041] In order to achieve the fast temperature ramp up rates discussed 
previously, the substrate 360 must at times be thermally decoupled from the 

15 cooled pedestal so that the energy input is not wasted in heating up the large 
thermal mass of the pedestal However, during fast temperature ramp downs, 
they must be coupled so that the cooled pedestal can efficiently remove heat 
from the substrate. Since the existence of a backside gas 340 is the primary 
means of heat transfer between the substrate 360 and the rest of the system in a 

20 vacuum or reduced atmosphere environment, a key part of this invention then is 
this state-based presence of the backside gas 340, or more specifically, the state- 
based pressure control 345 of the backside gas 340. With suitable valving and 
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pressure control 345^ application of wafer 360 backside gas 340 will facilitate 
thermal coupling between the wafer 360 and a cooled pedestal enabling the low 
temperature state. Valving off the backside gas 340 thermally decouples the 
substrate 360 from the cooled pedestal so that a high temperature state can be 
5 quickly achieved during RTP lamp 310 or laser 410 irradiation. This method is 
particularly effective in the semiconductor wafer processing industry since the 
thermal mass of the substrate 360 is very small, especially compared to a typical 
cooled pedestal/ and the heat fluxes are so large. This sequence is illustrated in 
FIG. 5. 

10 [0042] FIG. 5 shows substrate temperature 520 response to irradiation 500 
and state-dependent control of backside gas pressure 510. FIG. 5 illustrates how 
the backside gas 340 in conjunction with irradiation 500 can be used to cause the 
substrate temperature 520 to rapidly change from a low temperature to a high 
temperature and back to a low temperature state. The cooled pedestal can be 

15 cooled via chilled water, gases, or refrigerants to a steady state temperature near 
or significantly below room temperature (e,g., -40 °C to 20 °C). The pedestal can 
also be simply maintained at a desired low temperature state greater than room 
temperature by, for example, resistively warming the heater. In either case, the 
heat transferring backside gas 340 is used to toggle between the low temperature 

20 state 522 (backside gas 340 is "high" 514, e.g., 3-10 torr) of the pedestal to the 

high temperature state 524 (backside gas 340 is 'Tow'' 512, e.g., much less than 3 
torr) during RTP 310 or laser 410 irradiation. 
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[0043] It may be conceivable to perform the sequences described in FIGs. 1 
and 2 in separate RTF and ALD chambers. However, processing speed would be 
compromised. A preferable embodiment would be for the sequences described 
in FIGs. 1 and 2 to be carried out in a single chamber, 

[0044] From the description of the preferred embodiments of the process 
and apparatus set forth herein, it will be apparent to one of ordinary skill in the 
art that variations and additions to the embodiments can be made without 
departing from the principles of the present invention. 
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